AITopics | bias term

f0552f14388d95b19740dee809f5cad1-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 06:25:49 GMT

artificial intelligence, machine learning, neuron, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)

Add feedback

Computational Complexity of Learning Neural Networks: Smoothness and Degeneracy

Neural Information Processing SystemsApr-30-2026, 06:25:45 GMT

Understanding when neural networks can be learned efficiently is a fundamental question in learning theory. Existing hardness results suggest that assumptions on both the input distribution and the network's weights are necessary for obtaining efficient algorithms. Moreover, it was previously shown that depth-2 networks can be efficiently learned under the assumptions that the input distribution is Gaussian, and the weight matrix is non-degenerate. In this work, we study whether such assumptions may suffice for learning deeper networks and prove negative results. We show that learning depth-3 ReLU networks under the Gaussian input distribution is hard even in the smoothed-analysis framework, where a random noise is added to the network's parameters. It implies that learning depth-3 ReLU networks under the Gaussian distribution is hard even if the weight matrices are non-degenerate. Moreover, we consider depth-2networks, and show hardness of learning in the smoothed-analysis framework, where both the network parameters and the input distribution are smoothed. Our hardness results are under a wellstudied assumption on the existence of local pseudorandom generators.

artificial intelligence, machine learning, neuron, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Neural Information Processing SystemsMar-22-2026, 17:00:32 GMT

Uncertainty quantification (UQ) is a crucial but challenging task in many high-dimensional learning problems to increase the confidence of a given predictor. We develop a new data-driven approach for UQ in regression that applies both to classical optimization approaches such as the LASSO as well as to neural networks. One of the most notable UQ techniques is the debiased LASSO, which modifies the LASSO to allow for the construction of asymptotic confidence intervals by decomposing the estimation error into a Gaussian and an asymptotically vanishing bias component. However, in real-world problems with finite-dimensional data, the bias term is often too significant to disregard, resulting in overly narrow confidence intervals. Our work rigorously addresses this issue and derives a data-driven adjustment that corrects the confidence intervals for a large class of predictors by estimating the means and variances of the bias terms from training data, exploiting high-dimensional concentration phenomena. This gives rise to non-asymptotic confidence intervals, which can help avoid overestimating certainty in critical applications such as MRI diagnosis. Importantly, our analysis extends beyond sparse regression to data-driven predictors like neural networks, enhancing the reliability of model-based deep learning. Our findings bridge the gap between established theory and the practical applicability of such methods.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.82)

Add feedback

A Missing lemmas for the proof of Theorem 3.1

Neural Information Processing SystemsFeb-17-2026, 21:41:52 GMT

The following proof is from Daniely and V ardi [15], and we give it here for completeness. By Lemma A.1, there exists a DNF formula We construct such an affine layer in Lemma A.2. At least one of the k size-n slices in z contains 0 more than once. We define the outputs of our affine layer as follows. Pr [z represents a hyperedge ] = n (n 1) ... (n k + 1) null 1 n null Pr null z Z null 1 2 log(n) .

artificial intelligence, machine learning, neuron, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.51)

Add feedback

Computational Complexity of Learning Neural Networks: Smoothness and Degeneracy

Neural Information Processing SystemsFeb-17-2026, 21:41:48 GMT

Gaussian, and the weight matrix is non-degenerate.

artificial intelligence, machine learning, neuron, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

b444ad72520a5f5c467343be88e352ed-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 16:32:15 GMT

artificial intelligence, equation, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland (0.04)
Europe > France (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

9adc8ada9183f4b9a007a02773fd8114-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 02:31:26 GMT

artificial intelligence, excess risk, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.14)
Asia > China > Beijing > Beijing (0.04)
Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

LearningaSingleNeuronwithBias UsingGradientDescent

Neural Information Processing SystemsFeb-11-2026, 20:43:12 GMT

Learning a single ReLU neuron with gradient descent is a fundamental primitive in the theory of deep learning, andhasbeen extensivelystudied inrecent years.

artificial intelligence, assumption, machine learning, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

a284df1155ec3e67286080500df36a9a-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 09:43:05 GMT

Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable axioms, against the time required to compute them. This in turn either led to long training times or ineffective attribution priors.

artificial intelligence, attribution, machine learning, (17 more...)

Neural Information Processing Systems

Country: